FACTORIZATION PROBLEM ON THE HILBERT-SCHMIDT 
GROUP AND THE CAMASSA-HOLM EQUATION 



Luen-Chau Li 

ABSTRACT. In this paper, we solve the Camassa-Holm equation for a relatively large 
class of initial data by using a factorization problem on the Hilbert-Schmidt group. 

1. Introduction. 

The Camassa-Holm (CH) equation is a model of long waves in shallow water. 
Since its introduction in 1993 by Camassa and Holm [CH], the equation has re- 
ceived considerable attention and its various aspects were studied using a variety 
of methods. (See, for example, [BF],[BSS], [Con], [CM], [CS], [GH], [M], [XZ] and 
the references therein.) 

In contrast to the KdV equation, the CH equation admits breaking solutions. 
However, a relatively large class of initial data which give rise to global solutions 
has also been identified in [CI] and independently in [Con]. In the paper [CI] 
and its more elaborate version [C2], the initial value problem for the CH equation 
was analyzed through its characteristic formulation. It is in this context that an 
isospectral problem in the form of an integro-differential equation was discovered 
for the Lagrangian version of the integrable PDE. This remarkable fact has led, 
in particular, to a particle method for numerically solving the CH equation. (See 
[CHL] for subsequent development of this particle method.) 

The main goal of this work is to show that the integro-differential equation in the 
afore-mentioned papers of Camassa is in fact exactly solvable, in the sense that a 
formula for its solution can be written down. The upshot of this, as the reader will 
see, is that we can integrate both the Lagrangian version and the Eulerian version 
of the CH equation explicitly. The solution of the integro- differential equation 
is based on a factorization problem on the Hilbert-Schmidt group which we will 
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introduce. On the other hand, we will show that the factorization problem on 
the Hilbert-Schmidt group can be reduced to solving a family of Fredholm integral 
equations, and this can be achieved by using regularized Fredholm determinants and 
Fredholm first minors. The reader will see that the Lax equation which corresponds 
to the integro-differential equation is in some sense an infinite dimensional analog 
of the Toda flow on n x n matrices (cf. [DLT]). That such an analog is connected 
with the Camassa-Holm equation is rather remarkable and is responsible for the 
elementary method of solving the equation here. 

The paper is organized as follows. In Section 2, we begin by introducing a class of 
integrable isospectral deformations of Hilbert-Schmidt operators on L 2 (R) using the 
r-matrix approach, then we discuss the underlying Lie groups and coadjoint orbits. 
Since the Hilbert-Schmidt operators on L 2 (R) are given by integral operators with 
kernels in L 2 (R 2 ), this leads naturally to a class of integrable integro-differential 
equations. In particular, for a special choice of Hamiltonian, the corresponding Lax 
equation gives rise to the integro-differential equation obtained in [CI], [C2]. In Sec- 
tion 3, we discuss the solution of the factorization problem on the Hilbert-Schmidt 
group and its application towards the integration of the integro-differential equa- 
tion. Finally in Section 4, we show how to apply our results to obtain the explicit 
integration of the Lagrangian version and the Eulerian version of the Camassa-Holm 
equation. 

To close this introduction, we remark that the Lagrangian version of the CH 
equation was also explicitly integrated in [M]. We note, however, that the assump- 
tion on the initial data and the method used in [M] are quite different from the one 
employed here. (See Remark 4.3 for the relationship between the spectral problems.) 
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2. Classical r-matrices and integrable integro-differential equations. 

In this section, we introduce a class of integrable integro-differential equations 
associated with isospectral deformations of Hilbert-Schmidt operators. 

Let Ti be the Hilbert space L 2 (R) consisting of real- valued measurable functions 
on R that satisfy j R |/(x)| 2 dx < oo with inner product 



(/,<?)= / f(x)g(x)dx (2.1) 
Jm 



and let g be the space of Hilbert-Schmidt operators on Ti. It is well-known that 
[RS] K G g if and only if there is a function K G L? (R 2 ) uniquely determined by 
K such that 

(K<p)(x) = [ K(x,yMy)dy. (2.2) 

JR 

In other words, the Hilbert-Schmidt operators on Ti are precisely the integral op- 
erators on Ti with L 2 -kernels. Moreover, for K € q, the Hilbert-Schmidt norm is 
given by 

||K|| 2 = [ \K(x,y)\ 2 dxdy. (2.3) 

JR 2 

Let B(TL) be the space of bounded operators on Ti. We recall that if T G B(Ti), 
its adjoint T* is defined by means of the relation 

(TV,V0 = (^V) (2.4) 

for all cp G Ti and ip £ Ti. Specializing to the case where A G g C B(Ti) with kernel 
A(x,y), it is easy to show from (2.4) and Fubini's theorem that its adjoint A* is 
the integral operator on Ti with kernel A*(x,y) = A(y,x). Thus A* is also in q. In 
what follows, if A G 0, we shall use the notation 

A = A(x,y) (2.5) 

to mean that the integral operator A has kernel A(x,y). 

Proposition 2.1. g is a Hilbert Lie algebra with Lie bracket [•, •] defined by 

[A, B] = A o B - B o A 

(2.6) 

(A(x,z)B(z,y) - B(x,z)A(z,y))dz 



J 

JR 
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and the inner product on g is the usual Hilbert- Schmidt inner product (•, -) 2 , i.e., 

(A,B) 2 =tr(A* oB) 

f (2-7) 
= / A(x,y)B(x,y)dxdy. 

JR 2 

Moreover, g is equipped with the non- degenerate ad-invariant pairing 

(A,B)= f A(x,y)B(y,x)dxdy. (2.8) 

JR 2 

Proof. Since the Hilbert-Schmidt operators is closed under the operations of addi- 
tion, subtraction, and composition, it follows that the bracket operation in (2.6) 
is well-defined and it is clear that [•, •] is a Lie bracket. On the other hand, it is 
well-known that g with the inner product in (2.7) is a Hilbert space. Hence g is 
a Hilbert Lie algebra. We shall leave the rest of the assertion to the reader as an 
exercise. □ 

In addition to what we have above, we remark that as a special case of a general 
theorem (see, for example, [RS]), g is a 2-sided ideal in B{TL). Now, let i" € B{TL) 
be the identity operator, and let GL(TL) denote the group of invertible operators 
in B(7i). We define 

G = GL(H)f)(I + g). (2.9) 

If I + K £ G, it is well-known that (J + -RT) -1 is also in G. (See, for example, [Sm].) 
Hence G is a group under the composition of operators. As a matter of fact, G is a 
Hilbert Lie group which integrates the Lie algebra 0, the Hilbert manifold structure 
is being determined by the map G — ► g : I + K i— > K which is a bijection onto 
an open subset of g consisting of Hilbert-Schmidt operators for which —1 is not 
an eigenvalue. We will call G the Hilbert-Schmidt group. In this case, the adjoint 
action of the group G on g is given by the formula 

Ad G (g)K = goKog-\ (2.10) 

Because the pairing on g is ad-invariant, we also have 

(goAog- 1 ,B) = (A,g- 1 oBog). (2.11) 

On the other hand, the exponential map exp : g — > G is given by the expression 



oc 



exp(K)=^^ (2.12) 

3=0 J ' 
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where the powers of K are defined recursively by 

K° = I, K j = KoKi' 1 , j = 1,2,- •• . (2.13) 

The Hilbert Lie algebra g has two distinguished Lie subalgebras ( and t, where 
[ consists of Volterra integral operators A of the form 

(A<p)(x) = [ A(x,y)<p(y)dy (2.14) 

J — oo 

and t consists of integral operators B for which 

B* = -B. (2.15) 

We will call [ the lower triangular subalgebra of and 6 the skew- symmetric subal- 
gebra. 

Proposition 2.2. We have 

e = let 

Proof. Given if eg with kernel K(x,y), it is clear that 

k = n E K + niK 

where 

U t K = K(x,y)x( x ,oo){y) ~ K{y,x)x(-oo,x)(y) 

and 

U t K = K(x, y)x(-oo,x) (y) + K(y, x)x(-oo,*) (j/)- 

In the above expressions, X(x,oo)(y) an d X(-oo,x)(y) are respectively the charac- 
teristic functions of (x, oo) and (— oo, x). Now it is straightforward to check that 
U[K G ( and U^K £ t. Therefore, = I + t. To show that the sum is direct, 
suppose K(-, ■) is the kernel of an operator which belongs to both I and 6. Then 
K(x, y) = away from the diagonal and hence the corresponding operator is zero. 
This completes the proof. □ 

Let II[ and LTf be the projection operators onto I and 6 respectively associated 
with the decomposition = I © t. Then it follows from [STS] that 

R = Ui-Ut (2.16) 



6 



L.-C. LI 



is a classical r-matrix on g satisfying the modified Yang-Baxter equation (mYBE) 

[R(A),R(B)\ - R([R(A),B] + [A,R(B)]) = —[A,B] (2.17) 

for all A, B G 0. Consequently, the formula 

[A, B] R = ±([R(A), B] + [A, R(B)}), A,B<Eq (2.18) 

defines a second Lie bracket on q and we shall denote the associated Lie algebra by 
Qr. In what follows, we shall compute the dual maps of all linear operators on q 
with respect to the pairing (•, •) in (2.8). 

Proposition 2.3. If A G and A = A(x,y), then 

U* ( A = (A(x,y) - A(y,x))x(-oo,x)(y) 

n* A = A(x, y)x(x,oo) (y) + My, x)x(-<x,x) (y)- 

The proof is a straightforward calculation and so we skip the details. We shall 
equip 0^ — with the Lie-Poisson structure 

{F 1 ,F 2 } R (K) = (K,[dF 1 (K),dF 2 (K)] R ) (2.19) 

where F lt F 2 € C°°(0^), and dF^K) e is defined by the formula f t \ t=Q Fi(K + 
tK') = {dF i (K),K'),i = 1,2. 

The following result is a consequence of standard classical r-matrix theory. (See 
[STS] and [RSTS] for the general theory.) 

Proposition 2.4. (a) The Hamiltonian equations of motion generated by F E 
C°°(q* r ) is given by 

K = l -[R{dF(K)lK\ - ±R*[K,dF(K)}. (2.20) 

In particular, for the Hamiltonian Hj(K) = 2 (j+i) tr(Ki +1 ), j = 1,2,..., the 
corresponding equation is the Lax equation 

K=±[U l K*,K]. (2.21) 

(b) The family of functions Hj(K), j = 1, 2, . . . Poisson commute with respect to 
{v}*. 



CAMASSA-HOLM EQUATION 



7 



Proof. The Hamiltonian equation of motion (2.20) is obtained from (2.19) by a 
direct calculation. On the other hand, by using (2.11), we find that Ad* G (g~ l )A = 
g o A o g~ l , g <g G. Since tr(A o B) = tr(B o A) for any £? € 5(H) and any trace 
class operator A, it follows that Hj(Ad G (g~ 1 )K) = Hj(K), g € G. By classical 
r-matrix theory, we then conclude that the family of functions Hj(K), j = 1, 2, . . . 
Poisson commute with respect to {-,-}r. The equation of motion for Hj now 
follows from (2.20) as the invariance property Hj{Ad G (g~ l )K) = Hj(K) implies 
[K , dHj (K)] = 0. This completes the proof. □ 

Let 

p = {K eg\K = K*}. (2.22) 

Corollary 2.5. (a) p is a Poisson submanifold of (q* r , {■, -}r). Hence eqn. (2.21) 
with K € p is Hamiltonian with respect to the induced Poisson structure on p. 
(b) For the Hamiltonian H\(K) = j{K,K), the evolution of the kernel K{x,y;t) 
corresponding to K is given by the integro- differential equation 

1 r x if 00 
K(x,y;t)=- K(x,z;t)K(z,y;t)dz-- K(x, z ;t)K(z,y ;t) dz 

1 f x 1 f°° 

+ -/ K(z,x;t)K(z,y;t)dz - - K(x, z ;t)K(y, z ;t) dz. 

(2.23) 

In the special case when K belongs to the Poisson submanifold p, the corresponding 
kernel K(x, y; t) is symmetric. In this case, the above equation reduces to 



/x <»oo 
K(x, z ;t)K(z,y ;t) dz — / K(x, z ;t)K(z,y;t) dz 
-oo J y 

/y r°o 
K(x, z ;t)K(z,y ;t) dz — / K(x, z ;t)K(z,y ;t) dz. 
-OO J X 



(2.24) 



Proof, (a) From (2.20), the Hamiltonian vector field generated by F can be rewrit- 
ten in the form 

X F (K) = [K,U t (dF(K))] -U*[K,dF(K)\. 

From the expression for 11^ in Proposition 2.3, it is clear that the second term in 
the above expression is always in p. On the other hand, if K € p, then it is easy 
to check that [K, H^(dF(K))] is also in p. Consequently, we have X F (K) G p for 
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K G p and this shows that p is a Poisson submanifold of (g* R , {•, -}r). 
(b) We have 

{UiK o K)<p(x) = 1(1 (K{x, z;t) + K{z, x ; t))K{z, y ; t) dz) <p(y) 



dy 



while 



{KoU l K) V {x)= 1(1 K{x,z;t)(K(z,y;t) + K{y,z;t)) X (-oo,z)(y)dz)tp(y)dy. 
Therefore, the evolution of the kernel K(x, y;t) is given by 

/X /"OO 
K(x, z ; t)K(z, y\t)dz - I K(x, z ; t)K(z, y;t)dz 
-oo J y 

/x <>oo 
K(z,x ;t)K(z,y ;t) dz — / K(x, z ;t)K(y, z ;t) dz. 
-OO J V 



□ 

For our next remark and the discussion in Section 3, we introduce the Lie sub- 
algebra u of 5 which consists of Volterra integral operators B of the form 

/>oo 

{B<p)(x)= / B(x,y)<p(y)dy. (2.25) 

Remark 2.6. (a) From the definition of t and [, and from equation (2.21), it is 
clear that what we are dealing with here is in some sense an infinite dimensional 
analog of the Toda flows on n x n matrices (cf. [DLT]). 

(b) A different decomposition of the Hilbert Lie algebra q is given by 

= l©u (2.26) 

with associated projection maps IT_ : g — > I and n + : q — ► u. (We will use these 
projection maps in Section 3.) Indeed, if we consider the r-matrix R = IT_ — 
associated with this splitting and equip the corresponding q* r with the Lie-Poisson 
structure, then the evolution of the general kernel K(x, y; t) under the Hamiltonian 
flow generated by \{K, K) is given by the equation 

/X POO 
K(x, z ;t)K(z,y ;t) dz — / K(x, z ;t)K(z,y ;t) dz 
-oo J y 

/V poo 
K(x, z ;t)K(z,y ;t) dz — / K(x, z ;t)K(z,y ;t) dz. 
-oo J X 



CAMASSA-HOLM EQUATION 



9 



Clearly, eqn.(2.24) is a special case of this. Note, however, that p is not a Pois- 
son submanifold any more with this choice of r-matrix and the corresponding Lie- 
Poisson structure. Indeed, the Hamiltonian vector field generated by a general 
function F is now of the form [K,U+(dF(K))} - U*_[K,dF(K)}. Clearly, this is 
not necessarily in p for K G p. Thus from the Hamiltonian point of view, the 
r-matrix in (2.16) is the correct choice. For the relation between (2.24) and the 
Camassa-Holm equation, and the coadjoint orbit picture, we refer the reader to 
the discussion in Section 4 preceding Remark 4.1, Remark 4.2 and Proposition 2.8 
below for details. 

In the rest of the section, we shall describe the symplectic leaves of the Lie- 
Poisson structure {•, -}r which are given by the coadjoint orbits of the infinite 
dimensional Lie group Gr which integrates Qr. In particular, we shall consider the 
coadjoint action of Gr on the class p* of Hilbert-Schmidt operators K G q with 
so-called single-pair kernels [GK]. By definition, a Hilbert-Schmidt operator K G p* 
if and only if its kernel is of the form 

( a(x)b(y), x < y 
K(x,y) = \ ^ ~ g (2.27) 

I a(y)b{x), x>y, 

where a and b are functions on R. (Note that a and b are not necessarily in L 2 (M.).) 
In order to describe Gr, we begin by introducing the Lie subgroups (see Remark 
2.7) 

£ = ! + [, 

(2.28) 

lC = {keG\kok* = k* ok = I} 
of G which corresponds to the Lie algebras [ and t. 

Remark 2.7. It is clear that the group operation of G is closed on C On the other 
hand, if A € I, we can show that the Neumann series X^o( — converges by 
using the estimate || A J+1 1| < || AH^ 1 /^ (j — 1)!, j = 1, 2, . . . , which we can derive 
from eqn. (8), Section 2.7 of [Sm] provided we interpret the inequality there as being 
valid almost everywhere under our weaker assumption. Thus / + A is invertible 
and (I + A)" 1 = Y.T=o(~ l ) j Aj G C - This shows that C is a subgroup of G. That 
C is a Lie subgroup of G now follows since the former is clearly a submanifold of 
the latter. 



Let 



G R = {g G G | g = g- o g + \ where g. G C, g+ £ K }. (2.29) 
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Then following the procedure in [DLT] , we can endow Gr with a Lie group structure 
by defining the multiplication 

g*h = g-oho gZ 1 (2.30) 

and we can show that (Gr, *) is a Lie group which corresponds to the Lie algebra 
Qr. Moreover, the adjoint action of Gr on qr is given by 

Ad GR (g)K = g_ o U ( K o gZ 1 + g+ o U t K o gZ 1 . (2.31) 

Hence an easy computation using (2.11) shows that 

Adh^g^K = U*(g- o K o gZ 1 ) + n { *( 5+ oKoj; 1 ) (2.32) 

and the symplectic leaves of {-,-}r are the orbits of this coadjoint action. 



Proposition 2.8. The class p* C p of Hilbert- Schmidt operators with single-pair 
kernels is invariant under Ad* n 

Proof. Take K <E p*, 



K = K(x,y) 



a(x)b(y), x < y 
a(y)b(x), x > y. 



Then for # = g_ o g + 1 G G^, it is clear that ^'oifo^ ep. Therefore, Tl^g^ o 
K o g + ) = so that 

^( 5 )K = nr(< 7 z 1 oKo 5 _). 



Now, by a straightforward computation using the form of K(x, y) above and the 
fact that g- G C, we can show 

(gZ 1 a)(x)(g*_ b)(y), x<y 
(gZ 1 a)(y)(g*_b)(x), x>y 

from which we conclude that AdQ (g)K £ p*, as asserted. □ 



nr( 5 z 1 oKo 5 _)= y Zi 



From this result, it follows that the coadjoint orbit of Gr through an element 
K € p* will consist entirely of elements from p*. In particular, this means that if 
the initial data of (2.24) is a single-pair kernel, then K(x, y; t) is also a single-pair 
kernel for all t. This is the fact which underlies the geometry behind our application 
in Section 4 below. 

Remark 2.9. (a) As the reader will see in Theorem 3.1, every g G G admits a 
unique factorization g = g_ o gZ 1 with g_ G C and g + G /C. Thus the underlying 
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manifold of (Gr,*) is just G in this case. Note that this factorization result in 
Theorem 3.1 is nothing but the global version of the decomposition in Proposition 
2.2. 

(b) As a final remark of this section, we would like to point out that everything we 
have done in this section can be pushed through in the more general setting when 
the Hilbert space is taken to be L 2 (R, dji) for an arbitrary Borel measure \i on R. 



3. Solution by factorization. 

We recall the Lie subgroups 

C = I + l, 

(3.1) 

)C = {keG\kok* = k* ok = I} 
of G introduced in Section 2. As the reader will see, they play an important role 
in the solution of (2.24). In order to discuss the factorization problem, let us also 
recall several formulas from the theory of regularized determinants which we are 
going to need in our context. The reader is referred to [S], [Sm] for more details. 
Let A be a Hilbert-Schmidt operator on H = L 2 (R), then 

K 2 (A) := (I + A)e~ A -I (3.2) 

is of trace class. Following [S], we can define the regularized determinant 

det 2 (/ + A) :=det(I + K 2 (A)) 

OO 

:= J>A fc (K 2 (A)) 

k=0 

which obeys the estimate 



(3.3) 



|det 2 (/ + A)| <exp( 11^(^)11!) (3.4) 

where ||ft 2 (A)||i = tr( V / TZ 2 (A)*TZ 2 (A) ). Suppose I + K e G, then the analog of 
the first Fredholm minor is given by 

D 2 (K) = -K(I + K)- 1 det 2 (I + K) (3.5) 

and so we have the formula 



(3.6) 
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In connection with (3.6) above, it is important to note the Plemelj-Smithies formu- 
las 



det 2 (J + K) = l+X>^(K) 



m=l 



and 



Here, 



D 2 (K) = K+Y J ^\K). 



o${K) = — t det 



( 



m=l 



m — 1 






m — 2 




a m -i(K) <j m - 2 {K) a m - Z (K) 
(K) f r m _ 2 (K) 



and 



^ )(K) = ^! d6t 



K 2 
K 3 



m 






m — 1 




where 



K m <7 ro _i(JiC) a m _ 2 (K) 

<7 m (jr) cr m —i(K) 



a j (K)=tr(Ki), j>2. 








1 

0/ 








1 

0/ 



(3.7) 



(3.8) 



(3.9) 



(3.10) 



(3.11) 



We next introduce a piece of notation. If K <G Q and y € R, we shall denote by 
^ |(-oo,j/) the operator L 2 (— oo,y) — ► L 2 (— oo,y) defined by 



(-K" |(_oo,») <p)(x) = f K(x,z)ip(z) 

J — oo 



(3.12) 



for if £ L 2 (— oo,y). Similarly, (J + K) \(— 00 , y ) has an analogous meaning. 

In order to solve the integro-differential equation (2.24), the following result is 
basic. 

Theorem 3.1. Suppose I + K £ G, then I + K has a unique factorization 



I + K = b_ob~ 1 



(3.13) 
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where 6_ G C and b + G /C. // (6I 1 )* — I = C+(x,y), then explicitly, C+(x,y) is 
given by 

C+(x,y) = - (((J + 5)| ( _ 00) , ) )- 1 5(-,y)) (*), x < y 

_ , . _ p2(5|(-oo, y) )^(-^))(x) (3.14) 
WJ det 2 ((7 + S)| ( _ 0O ,, ) ) 

and C+(x,y) = for y < x where S(x,y) is the kernel of 

S = K + K* + KoK*. (3.15) 

Proof. By the analog of Remark 2.7, 

U = J + u 

is the Lie group corresponding to the Lie algebra u introduced at the end of Section 
2. For each y G R, consider the equation 

C{x,y)+f S(x,z)C(z,y)dz = -S(x,y), x<y. (3.16) 

J — oo 

Since 1 + 5 = (7 + K) o (I + K)* is positive definite, it follows that (I + S , )j(_ 00i2/ ) 
is invertible. Hence (3.16) has a unique solution given by 

C + (x,y) = - (((/ + 5)| ( _ 00iJ/) )- 1 < S(-,2/)) (x), x < y 

(D 2 (S , |(_ 00iJ/ ))S , (-,y))(x) 



= -5(x,j/)- 



det 2 ((I + 5)| ( _ 00)J/) ) 



where we have used the formula in (3.6). Set C+(x,y) = for y < x and let C+ 
denote the corresponding operator in u. Then from (3.16), we have 

C++ U + (S oC + ) = -S + (3.17) 

where n + : q — ► u is the projection operator to u relative to the splitting in (2.26) 
and S + = H + S. But on the other hand, we find 

(l-U + )(S + SoC + )-S 

(3.18) 

= (I + S)oC + . 



Hence it follows that 



where 



(I + S) o (I + C+) = I + B_ 

B_ := n_ (S + S o C + ) G [. (3.19) 
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Set b- = I+B_ and c+ = I+C+. Then I+S = b-Oc+\ But from (I+S)* = I+S, 
we also have I + S = (c^ 1 )* o b*_. Equating the two expression for I + S, we find 

c* + o b- = b*_ o c+ G £ n U. 

As £ n = 7, we conclude that c + = and so we have established the 

factorization 

I + S = b-ob*_. 

Now we define 

b+ = (I + K)- 1 ob_. 

Then a straightforward verification shows that b + 6 /C. Finally, the uniqueness of 
the factors b± is obvious. □ 

Theorem 3.2. Let K € and let b-(t) £ £, b + (t) € K, be the unique solution of 
the factorization problem 



exp {-fK^j =b_(t)ob+(t)- 1 . 

Then for all t, 



(3.20) 



K{t) = b ± {t)~ l oK ob±(t) (3.21) 
solves the initial value problem 

K = ^[U t K,K] = ^[K,IL t K], K(0) = K . (3.22) 

Proof. We shall present a direct proof of the theorem. First of all, the factorization 
problem in (3.20) has unique solutions b-(t) G £ and b + (t) G K. by Theorem 3.1. 
Take 

K{t) = 6+(t) _1 oK ob+(t). 
By differentiating the expression, we have 

k(t) = [K(t),b + (t)- 1 ob + (t)}. (*) 



On the other hand, by differentiating (3.20), we find 

- l -K{t) = b-it)- 1 o 6_(t) - M*) -1 b+(t). 
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Hence by applying lit to both sides of the above expression, the result is 



b + {t)- 1 ob + {t)= l -Tl l K(t). 



Therefore, on substituting into (*), we conclude that 



K{t)= l -[K{t),Ii i K{t)\. 



This shows that K{t) = b + (t) 1 o K o b + (t) solves the initial value problem. □ 

Remark 3.3. (a) The factorization method is the most important feature of clas- 
sical r-matrix theory. It should be emphasized that there is no universal method 
to solve the factorization problems. Rather, the method varies with the Lie groups 
involved. For examples involving finite dimensional matrix groups, the reader is 
referred to [DLT] for further information. On the other hand, factorization prob- 
lems associated with loop groups are related to Riemann-Hilbert problems. See, 
for example, [RSTS] and [DL] in this connection. 

(b) We can also give a geometric proof of Theorem 3.2 by using Poisson reduction. 
Indeed, from this point of view, we can understand the Hamiltonian flow in (3.21) 
as the projection of some simple Hamiltonian flow on the cotangent bundle T*G. 
We will give a sketch of the argument here, following essentially the outline on p. 
180 of [STS]. Let (Gr, *) be the Lie group introduced in Section 2 with Lie algebra 
Qr. Consider the action of Gr on G, 



We can lift this up to obtain a canonical action on T*G [AM]. Indeed, this lifted 
action is given by 



where we have made the identification T*G ~ G x g* ~ G x g by using left 
translation and the pairing on g. Consequently, by Poisson reduction, the orbit 
space T*G/Gr ~ g* ~ g has a unique Poisson structure such that the canonical 
projection 



7r : T*G — ► T*G/G R ~ g, (g, K) -> Ad* G {g + )K = g- 1 o K o g + (3.25) 



GrxG — >G: g ■ h = g_ oho g+ \ g eG R ,heG. 



(3.23) 



G R x T*G — ► T*G : g ■ (h, K) = (g_ oho g~\ Ad^g+^K) 



(3.24) 
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is a Poisson map. By a direct computation, we can show that the Poisson struc- 
ture on g ~ T*G/Gr is nothing but {-,-}r- Now we consider the bi-invariant 
Hamiltonian on T*G ~ G x g, given by 

H 1 (g,K)=H 1 (K). (3.26) 

Then its Hamiltonian equations of motion are 

q = oo K . 

T (3.27) 

k = o. 

Therefore, if we denote the corresponding flow by F t , we have in particular that 

F t (I, K ) = (exp (-^o) , K ^j . (3.28) 

Consequently, the Hamiltonian flow generated by the reduced Hamiltonian H rec i. = 
Hi on the orbit space T*G/Gr ~ Q is given by 

F t (K )=iroF t (I,K ) 

= Ad* G (b + (t))K (3.29) 
= b + (t)- 1 oK ob + (t), 

as required. 

We are now going to write down the solution of the integro-differential equation 
(2.24) by combining the above theorems. For this purpose, let K = K(x,y), 
K(t) = K(x, y; t) and write 

exp(-tKo) = I + S(t), S(t) = S(x,y;t). (3.30) 

By Theorem 3.1, the solution b_(t) of the factorization problem (3.20) is such that 

(b-it)- 1 )* -I = C t + =C + (x,y;t) (3.31) 

where 

C + (x,y;t) = - ((e-^ | ( _ OO!j/) )- 1 5(-,y;t)) {x) X{x ,oo){v) 

( qi , (D2(S(t)\ { _ 00ty) )S(;y;t))(x) \ (3.32) 

= " t S(X > y;t) + det 2 (e--o| ( _ oo ^ ) ) ) X(-.-o,(y). 
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On the other hand, it follows from the proof of Theorem 3.1 (see (3.19) and the 
definition of b_) that 

b-{t) = I + Ii_(S{t) + S{t)oC t + ) (3.33) 

and hence 

b-(t)-I= (s(x,y;t) + J S{x, z;t)C + {z,y;t)dz^J X(-oo,*)(l/)- (3-34) 

Consequently, from 

K(t) = 6_(t) _1 oK ob_(t) 

= K + (M*)" 1 - /) o K + (K + (ft-(t)- 1 - J) o Ko) o (&_(*) - J) 

(3.35) 

and (3.31), (3.34), we find 

/OO 
C + (C,£;t)tf(C,*7)dC 
-oo 

+ ^ (k(Z,(2) + J_ C+(Ci,C;^(Ci,C2)dCi) (3.36) 

• (s(C 2 , + /°° S(C2,C3;t)C + (C3,^)dC 3 ) dC 2 . 

Remark 3.4. In a similar fashion, we can write down the solution of the integro- 
differential equation (2.23). Indeed, all we have to do is to replace e~ tK ° in (3.30) 
and (3.32) by exp (— \tK a ) o exp [—^t(K )*) . The operator S(t) and its kernel 
S(x, y; t) are of course much more complicated in this case. 

4. Solution of the Camassa-Holm equation. 

In this section, we shall consider the Camassa-Holm (CH) equation in the non- 
dispersive case: 

ut ~ Uxxt + 3uux = 2u x u xx + uu xxx . (4.1) 

We will begin with a sketch of the connection between (4.1) and the integro- 
differential equation (2.24), as first discovered by Camassa [CI]. To do so, we 
introduce the auxiliary variable 



m(x,t) = (1 — d x )u(x,t) 



(4.2) 
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and rewrite the CH equation in the form 

m t + um x = —2mu x . (4-3) 

We shall make the following assumption on the initial data: 

(i) uq = u(-, 0) is in the Schwarz class <S(R), 

(ii) m(x,Q) = u (x) - u'q(x) > for all x6l. 

In the Lagrangian point of view of the CH equation, we consider the trajectory 
</(£, t) of a fluid particle which at t = is located at £ <G R. Thus we have 

q(Z,t)=u(q(£,t),t), ?(£,0)=£ (4.4) 

and a straightforward calculation (see [CI], [Con]) using (4.3) and (4.4) shows that 

m(q(Z,t),t)(qt(Z,t)) 2 = m(£,0). (4.5) 

Now it follows from [C1],[C2] and [Con] that for the class of initial data introduced 
above, we have 

0<qt(Z,t)<oo (4.6) 

for all t > 0. In particular, this means that the trajectories of the fluid particles 
never cross. 

Therefore, if we define 

y / m(y,t)dy, (4.7) 

-oo 

then 

w(q(Z,t),t)=w(Z,0)=w o (O (4.8) 
so that we can rewrite (4.5) as 

m(?e '"' i) = («§) 2 ' <"> 

By introducing the auxiliary function 

p(S,t)=m(q(t,t),t)qt(Z,t) = (4.10) 
and using the formula 

<x,t) = \f e-\*-^\m(q(r,,t),t) qTI ( V ,t)dT,, (4.11) 
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it follows from (4.4) and (4.9) that 

Q^,t) = l [ e-MM-«M\p(v,t)dr, 

Z JR 



6H_ 

5p ' 



(4.12) 



and hence 



P(S,t) = ±p(S,t) [ sgn^- V )e-\^-^p( V ,t)d V 

1 JR 



5H_ 

5q 



(4.13) 



where 



H=\! e-\ q ^- q ^p(C,t)p(7],t)d^dr]. (4.14) 
4 Jr2 

Therefore, (4.12), (4.13) is a Hamiltonian system with constraint given by (4. 10). (See 
Remark 4.1 (b) and the appendix for more details.) If q{£,t), p(£,t) satisfy (4.12), 
(4.13) and we define 

K{£,v,t) = \e-*MM-^\y/P&tMv,t), (4.15) 

then a direct calculation shows that K(£, 77; t) evolves under the integro-differential 
equation (2.24). Note that the kernel K(-, ■ ; t) defined in (4.15) is a positive single- 
pair kernel. Moreover, if (■, • ; t) € C(M 2 ) n L 2 (R 2 ) and the corresponding operator 
is of trace class. (See Remark 4.1 (b).) 

Remark 4.1. (a) For the class of initial data which we consider here, it has been 
established in [R] that the solution u of the Cauchy problem associated to the CH 
equation (4.1) satisfies u € C°°((0, oo),<S(R)). 

(b) On the other hand, by the ODE for q(£,t) in (4.4) (and Remark 4.1 (a)) or 
otherwise, we know that q(-,t) € C°°(M) and hence p(-,t) in also in C°°(R) by 
(4.10). Now it follows from (4.12) that 

qdtt) = (-^^(C-^e-l^-^lp^,*)^ q&,t). 

Solving, we find 

qt(£, t) = exp (-±J* jf sgn(i - r,)e-^-^^p( V , r) dr, dr^j . (**) 
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Since P = f R p(r],t) drj is a conserved quantity [CI], we obtain from (**) that 
e -.Pt/2 < fyfat) < e pt / 2 , (£l,t>0. Consequently, for each t > 0, the func- 
tion q(-,t) is a diffeomorphism of the line which is strictly increasing and satisfies 
lim^j-oo t) = ±00. Moreover, q(-,t) is a tempered distribution. Now, if we 
continue by differentiating (**) repeatedly, we can show by induction that there 
exist constants Cfc(i) > for t > such that |d|<?(£, t)\ < Ck(t), k = 2, 3, • • ■ . Since 
m (£>0) = (^'(C)) 2 i s a rapidly decreasing function, it is now easy to see by using 
these bounds that p(-,t) as defined in (4.10) is also a rapidly decreasing function. 

(c) We will show that the map (q,p) 1— ► j(q,p) = 2e~^ q ^~ q ^ v ^ \Jp{i)p{ri) is a 
Poisson map in the appendix. (This is where the discussion in (b) above will be 
used in setting up the domain of the map.) However, it is important to point out 
that the image of this map is not invariant under Ad GR , although it is invariant 
under the CH-flow. 

To solve the Cauchy problem 

u t - u xxt + 3uu x = 2u x u xx + uu xxx , u(x, 0) = u (x) (4.16) 

in the space <S(R) under the above assumptions, we proceed as follows. From the 
initial data uq which satisfies assumptions (i) and (ii) above, we obtain the initial 
conditions for q(£,i) and p(£,t): 

g(e,o) = e, ^,o) = m(e,o) = K(o) 2 . (4.17) 

From this, we obtain the kernel K(£,r]) of the operator Kq, namely, 

m,v) = le-^-M(0<(v)- (4.18) 

Remark 4.2. Note that from our assumption on the initial data uq and (4.18) 
above, it is clear that Kq G p*. Hence it follows from Proposition 2.8 that the 
solution K(t) of (3.22) is on the coadjoint orbit Ok = { Ad* GR {g)Ko \ g G G } C p* 
for all t. 

Now we solve the initial value problem (3.22) whose solution K(t) is given by the 
formula in (3.21) and we denote the kernel corresponding to K{t) by K(£,r), ;t). 
Since K(£, r], ; t) is given by the formula in (4.15) where q(£, t), p(£, t) are solutions 
of (4.12), (4.13) with initial condition given in (4.17), we can recover p(r/,t) from 
the formula 

p( V ,t) = 2K{ri,r l ;t). (4.19) 
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From (4.15) and (4.19), it follows that 



-\q(t,t)-q( V ,t)\ = K(^7];t) 2 



(4.20) 



Hence we obtain 

«^=/^£« ,, 21) 

Jr K(r],r);t) 

and consequently, we can determine q(r),t) from the formula 

q(v, t)=V+ f(( *WT^ d() dr. (4.22) 

Jo \Jr k{v,v;t) ) 

Finally, we obtain the solution of the Cauchy problem (4.16) from 

u{x,t) = \ [ e-\ x - q{r >^p{r],t)dr} (4.23) 

where p(rj,t) and q(r),t) are given in (4.19) and (4.22) above. 

Note that the explicit formula for K(£, t]; t) is given in (3.36) where in the present 
case we can interpret equality in the pointwise sense as the kernels of all operators 
involved in (3.35) are continuous. Alternatively, we can make use of Mercer's the- 
orem in writing down the explicit formula for K(^,7];t). In this connection, note 
that K is compact and hence its spectrum a(K ) is discrete with no limit points 
except possibly at 0. Moreover, K has no negative eigenvalues. To see this, let 
A be an eigenvalue of Kq and cj> a corresponding normalized eigenfunction. Then 
from 

I f e-^-^iOw'^Wdr) = A<M0, (4-24) 

we have 

A = \ [ (f e-te-^w&Mdr,) KMO^- (4.25) 

Therefore, on applying the Parseval formula and the convolution theorem in Fourier 
transforms to the right hand of (4.25), we obtain 1 

A = lf'(ffi (4.26) 

from which the positivity is clear. (Here w' (j) is the Fourier transform of w (f>.) 
Hence this shows that K has no negative eigenvalues. To give an estimate of 



1 We owe this argument and the sharpening of (4.28) to the referee. 
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a(K ), note that the integral on the right hand side of (4.25) obeys the inequalities 

JR 

where we have used the Cauchy-Schwarz inequality and the normalization of (p in 
going from the second line to the last one in (4.27) above. Combining the above 
analysis, we can now conclude that 

<7(jr )c[0,P/2]. (4.28) 

Let a(K ) = {A^}^ 1 and let {<pi\'^L 1 be the corresponding normalized eigenfunc- 
tions. Then we have 

oo 

K o = ^A i i «)0 i (4.29) 

i=l 

so that from (3.21), (3.20) and the orthogonality of b+(t), we find 

oo 

K{t) =Y J \{K{t)~ 1 4>i) ® 

1=1 

oo 



(4.30) 



i=l 



Since o-(«"(t)) = <r(K ), the series ZZi V^M^'V^XM*)" 1 ^)^) con- 
verges absolutely and uniformly on compact sets and it follows from (4.30) and 
Mercer's theorem that 

oo 

K(£, m t) =J2^~ tXi (b-(tr 1 4>i)(0(b-(tr 1 <l>i)(v) (4.31) 
i=i 

where from (3.31) and (3.32), 

(&_(*) <f>i)(x) = <f>i(x)- \S(y,x;t) + 7 _ ' r <M?/)<%- 

J —oo \ det2 |(— oo,x)J J 

(4.32) 

Remark 4.3. Under different assumption on m(£, 0) and using an entirely different 
method, the Lagrangian form of the CH equation was integrated in terms of certain 
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Fredholm determinants in [M]. In particular, the spectral problem in [M] is given 
by 

(^-d 2 x )f(x,t)=Xm(x,t)f(x,t). (4.33) 
On the other hand, our spectral problem reads: 

1 r°° 

2 J e-^^-^Wp(^t)p( V ,t)Hv,t)dv = \<f>(S,t). (4.34) 

It is a natural question to ask if (4.34) can be derived from (4.33). To put it 
differently, can we derive the kernel (4.15) (as discovered in [CI]) from (4.33)? As 
the referee pointed out to us, the answer is an affirmative yes. Indeed, under the 
assumption in [M], (4.33) can be rewritten as 

f e-^-y\m(y,t)f(y,t)dy = f -^-. (4.35) 

Then the change to the Lagrangian variable x = q(£, t) gives 

f°° e -^^-^\p{v,t)f(q(v,t),t)dr, = /(g( y ),t} (4.36) 

where we have invoked the definition of p(-, t) in (4.10). For our class of initial data, 
t) > for all t. Therefore, if we multiply both sides of (4.36) by y/p{£, t), and 
setting <f>(£,t) = f{q{£,t),t)-\/p(£,i), the result is (4.34) above (modulo a factor |) 
provided we replace A by j. 
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Appendix 

Let U = <S'(R) (~1 {strictly increasing diffeomorphisms q : R — > R}, where <S'(R) 
is the space of tempered distributions, and let V be the subset of <S(R) consisting 
of those p € <S(R) which are strictly positive. We equip U x V with the Poisson 
bracket 

timm- mm)* 

For (q,p) eU xV, put 

Proposition A. The map 7 : U x V — ► $3^ ~ g^ defined by 

is a Poisson map. 
Proof. We want to show 

{Fx o 7 ,F 2 o~f}(q,p) = {F 1 ,F 2 } R (-f(q,p)). 

To compute the Frechet derivatives, we proceed as follows. First of all, 

By using the definition of 7, a straight forward computation shows that 

i(q + eq,p) = 



±e + (0£-(v)(q(0-q(v)), £<v, 

±e+(r,)e-(Z)(q(r,)-q(0), £ > V- 
Substitute this into (f), it follows after some manipulation that 



d_ 

de 



6=0 



= ^ + (0((n ( ^( 7 ))*£_)(0 -£_(6((n ( ^( 7 )K + )(0 

where dFj( 7 ) is the shorthand for dFi(-y(q,p)) which we will use from now onwards. 
In a similar way, we can show that 



d_ 

de 



7(q,p + ep) 

e=0 
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and 

Next, we substitute the Frechet derivatives into the expression for the Poisson 
bracket between ^07 and F 2 o 7, this gives 

{Ft o7,F 2 °7}(?,ri 

=\ f [((n [ dF 1 ( 7 ))^_)(0((n l dF 2 ( 7 )K + )(0 
-((n [ dF 1 ( 7 )K + )(0((n 1 dF 2 ( 7 ))^_)(0 

= j [((n [ dF 1 ( 7 ))*£_)(0((n [ dF 2 ( 7 ))£ + )(a 

-((U l dF 1 ( 1 ))£ + )(OmidF 2 (^ri-)(0]d^. 
But now by using the fact that II[ dFi( 7 ), II[ dF 2 ( 7 ) G I, we can show that 

/ ((n ( ctfi( 7 )H_)(0((n, dF 2 ( 7 ))£ + )(0 = (7(5,2?), n t dFi( 7 ) o n ( dF 2 ( 7 )). 

Interchanging the indices 1 and 2, we obtain 

/ ((n t dF 2 ( 7 )*£_)(0((n[ dFi( 7 )^+)(0 = (7(5,2?), n ( dF 2 ( 7 ) o n< <tf\( 7 )). 

Therefore, when we subtract the second expression from the first, the result is 

{f 1 o 7 ,f 2 o 7 }(g, P ) = ( 7 ( g , P ),[n, ^(7)^^2(7)]). (t) 



To complete the proof, note that (7(5,2?), [Ilf <iFi( 7 ), Tit <iF 2 ( 7 )]) = as 7(5,2?) £ p- 
Consequently, when we add —(7(5,2?)) [TEe ^1(7), II| <iF 2 (7)]) to the right hand side 
of ($), the assertion follows. □ 
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